Plot your data.
نویسنده
چکیده
Writers of statistics textbooks tend to copy other textooks rather than draw on experience. This leaves a serious ap: Techniques that are highly useful in practice are not aught. With this series of columns I am trying to fill that ap. The first column [1] pointed out that doing something imperfect) is better than doing nothing. The second [2] was bout the value of transforming your data. A third neglected lesson about data analysis is plot your ata. More precisely, make all reasonable graphs of your ata. Make a histogram of every measurement (to see its istribution), plot every measurement against its date, and lot every measurement against every other measurement. his is a good way to generate ideas. To read almost any statistics textbook, even the best e.g., Box et al. [3]), you’d think science was all about esting ideas. It isn’t. Where do the tested ideas come from? dea generation, which these books ignore, is just as imortant as idea testing. One of the best ways to generate new deas worth testing, I have found, is to make many graphs of y data. It is like searching for buried treasure. New ideas worth esting are very valuable but hard to find. Only a tiny raction (1%?) of the graphs I’ve made led to new ideas but ome of those ideas had a big effect. My graphs generated new ideas in two ways. 1) Causalty. The graph suggested a cause–effect relation I hadn’t hought of. 2) Simplicity. Something turned out to be simler than expected. Here are examples.
منابع مشابه
Data Visualization of Outliers from a Health Research Perspective Using SAS/GRAPH and the Annotate Facility
SAS/GRAPH is a powerful tool for customizing the box plot to detect and identify outliers. This paper shows how to use the ANNOTATE facility and annotate data set to customize box plots and profile plots of outliers using data from a dietary-health study. This paper assumes: • A working knowledge of basic SAS/GRAPH procedures. • The ability to display or print graphics on your operating system...
متن کاملGetting the most out of your permanent plot data
A catalogue of ideas for graphical analyses of growth data is presented, in the hope of stimulating the more imaginative analyses. Graphs can be particularly revealing, because the human eye is good at detecting patterns. Suggestions are given to make graphs more effective, and analyses more insightful.
متن کاملThe sequelae of misinterpretating surgical outcome data.
On a normal working day, a plot is presented to You that will change your life forever. The plot (Fig. 1A) represents your variable life-adjusted display (VLAD) curves depicted versus VLAD curves from your ‘competing’ colleagues. This VLAD curve, a real case, a real plot (the surgeon is not an author of this letter), presents a plot of your cumulative sum of the difference in expected and obser...
متن کاملMuch of the Solution Written by Afshin Rostami and Umar Syed
1. Implement AdaBoost with boosting stumps and apply the algorithm to the spambase data set of HW2 with the same training and test sets. Plot the average cross-validation error plus or minus one standard deviation as a function of the number of rounds of boosting T by selecting the value of this parameter out of {10, 102, . . . , 10k} for a suitable value of k, as in HW2. Let T ∗ be the best va...
متن کاملInformation at your finger tips: Exploring the US Census Data
Figure 1: United States, Year 2000 Median Household Income – on the U.S. National Level plot there are high income clusters on the East Side of Central Park, and in suburbs of Chicago but not its downtown neighborhood. In the San Francisco area we can identify Silicon Valley; the income in this small area is significantly greater than average (Data=Block Level; Global Shape=Cartogram based on H...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Nutrition
دوره 25 5 شماره
صفحات -
تاریخ انتشار 2009